Skip to content

FEAT: Split-payload multi-turn attack strategy#1733

Open
ek0212 wants to merge 8 commits into
microsoft:mainfrom
ek0212:users/evekazarian/split-payload-attack-strategy
Open

FEAT: Split-payload multi-turn attack strategy#1733
ek0212 wants to merge 8 commits into
microsoft:mainfrom
ek0212:users/evekazarian/split-payload-attack-strategy

Conversation

@ek0212
Copy link
Copy Markdown

@ek0212 ek0212 commented May 14, 2026

Summary

Adds a split-payload multi-turn attack strategy to PyRIT. Split-payload attacks distribute a harmful request across multiple conversation turns where each individual turn passes all safety classifiers independently, but the assembled sequence constitutes a harmful instruction.

This is different from Crescendo (which escalates) - split-payload fragments a single harmful request across innocuous turns. The test: could you write a single harmful prompt that contains the same information as the full conversation? If yes, it's a split payload.

Changes

New attack strategy

  • \SplitPayloadAttack\ extending \MultiTurnAttackStrategy\ with three-phase execution:
    1. Decomposition: adversarial LLM decomposes objective into innocuous turns
    2. Execution: sends turns sequentially, verifying each passes per-turn safety
    3. Assembly audit: scores assembled conversation against original objective

Seed datasets

  • 60 split-payload attack conversations across 10 categories
  • 60 topic-matched benign hard negatives using vocabulary overlap

Design patterns

Four fragmentation patterns: component assembly, parameter specification, referential chains, context+technique split

Demo notebook

Shows configuring and running the attack against a target

Closes #1734

Testing

  • Unit tests needed
  • Integration test with mock target needed

@ek0212 ek0212 changed the title Feature: Split-payload multi-turn attack strategy FEAT: Split-payload multi-turn attack strategy May 21, 2026
@ek0212
Copy link
Copy Markdown
Author

ek0212 commented May 21, 2026

@microsoft-github-policy-service agree

…evekazarian/split-payload-attack-strategy
@ek0212 ek0212 force-pushed the users/evekazarian/split-payload-attack-strategy branch from 82fb4bf to 7773d92 Compare May 21, 2026 16:15
@ek0212 ek0212 marked this pull request as ready for review May 21, 2026 17:18
@ek0212 ek0212 marked this pull request as draft May 21, 2026 17:34
@ek0212 ek0212 marked this pull request as ready for review May 21, 2026 17:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

FEAT: split-payload multi-turn attack strategy

1 participant